Tracking Human Body Pose on a Learned Smooth Space
نویسندگان
چکیده
Particle filtering is a popular method used in systems for tracking human body pose in video. One key difficulty in using particle filtering is caused by the curse of dimensionality: generally a very large number of particles is required to adequately approximate the underlying pose distribution in a high-dimensional state space. Although the number of degrees of freedom in the human body is quite large, in reality, the subset of allowable configurations in state space is generally restricted by human biomechanics, and the trajectories in this allowable subspace tend to be smooth. Therefore, a framework is proposed to learn a low-dimensional representation of the high-dimensional human poses state space. This mapping can be learned using a Gaussian Process Latent Variable Model (GPLVM) framework. One important advantage of the GPLVM framework is that both the mapping to, and mapping from the embedded space are smooth; this facilitates sampling in the low-dimensional space, and samples generated in the low-dimensional embedded space are easily mapped back into the original highdimensional space. Moreover, human body poses that are similar in the original space tend to be mapped close to each other in the embedded space; this property can be exploited when sampling in the embedded space. The proposed framework is tested in tracking 2D human body pose using a Scaled Prismatic Model. Experiments on real life video sequences demonstrate the strength of the approach. In comparison with the Multiple Hypothesis Tracking and the standard Condensation algorithm, the proposed algorithm is able to maintain tracking reliably throughout the long test sequences. It also handles singularity and self occlusion robustly.
منابع مشابه
Monocular Tracking 3D People with Back Constrained Scaled Gaussian Process Latent Variable Models
Tracking 3D people from monocular video is often poorly constrained. To mitigate this problem, prior information can be exploited. In learning the prior stage, most algorithms think representing high-dimensional pose space in low-dimensional space as dimension reduction procedure, without considering the geometrical relation or time correlation in pose space. Therefore, the prior loses physical...
متن کاملMonocular Tracking of 3D Human Motion with a Coordinated Mixture of Factor Analyzers
Filtering based algorithms have become popular in tracking human body pose. Such algorithms can suffer the curse of dimensionality due to the high dimensionality of the pose state space; therefore, efforts have been dedicated to either smart sampling or reducing the dimensionality of the original pose state space. In this paper, a novel formulation that employs a dimensionality reduced state sp...
متن کاملHierarchical Approach for Articulated 3D Pose-Estimation and Tracking
In the recent years we presented a number of methods for a fully automatic pose estimation [5, 7] and tracking [6] of human bodies in 2D [5] and 3D [6]. Initialization and failure recovery in these methods are facilitated by the use of loose-limbed body model [7] in which limbs are connected via learned probabilistic constraints. The pose estimation and tracking can then be formulated as an inf...
متن کاملتخمین چنددوربینی حالت سه بعدی انسان با برازش افکنش مدل اسکلت سه بعدی مفصل دار در تصاویر سایه نما
Automatic capture and analysis of human motion, based on images or video is important issue in computer vision due to the vast number of applications in animation, surveillance, biomechanics, Human Computer Interaction, entertainment and game industry. In these applications, it is clear that 3D human pose estimation is an essential part. Therefore, its accuracy has a great effect on the perform...
متن کاملMulti-activity Tracking in LLE Body Pose Space
We present a method to simultaneously estimate 3d body pose and action categories from monocular video sequences. Our approach learns a lowdimensional embedding of the pose manifolds using Locally Linear Embedding (LLE), as well as the statistical relationship between body poses and their image appearance. In addition, the dynamics in these pose manifolds are modelled. Sparse kernel regressors ...
متن کامل